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Abstract 

Algorithms to generate various combinatorial structures find tremen- 
dous importance in computer science. In this paper, we begin by reviewing 
an algorithm proposed by Rohl [24] that generates all unique permutations 
of a list of elements which possibly contains repetitions, taking some or all 
of the elements at a time, in any imposed order. The algorithm uses an 
auxiliary array that maintains the number of occurrences of each unique 
element in the input list. We provide a proof of correctness of the algo- 
rithm. We then show how one can efficiently generate other combinatorial 
structures like combinations, subsets, n-Parenthesizations, derangements 
and integer partitions & compositions with minor changes to the same 
algorithm. 

1 Introduction 

Algorithms which generate combinatorial structures like permutations, combi- 
nations, etc. come under the broad category of combinatorial algorithms [TB]- 
These algorithms find importance in areas like Cryptography, Molecular Bi- 
ology, Optimization, Graph Theory, etc. Combinatorial algorithms have had 
a long and distinguished history. Surveys by Akl p], Sedgewick [27], Ord 
Smith [2U EE] , Lehmer [18] , and of course Knuth's thorough treatment [T2] IT3] , 
successfully capture the most important developments over the years. The lit- 
erature is clearly too large for us to do complete justice. We shall instead focus 
on those which have directly impacted our work. 

The well known algorithms which solve the problem of generating all n\ per- 
mutations of a list of n unique elements are Heap Permute [9], Ives [11] and 
Johnson- Trotter [29] . In addition, interesting variations to the problem are - 
algorithms which generate only unique permutations of a list of elements with 
possibly repeated elements, like [31 13 US], algorithms which generate permuta- 
tions in lexicographic order, like [50] EM HE] and algorithms which genenerate 
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permutations taking r(< n) elements at a time. With regard to these variations, 
the most complete algorithm, in our opinion, is the one proposed by Rohl [24] 
which possesses the following salient features: 

1. Generates all permutations of a list considering some or all elements at a 
time 

2. Generates only unique permutations 

3. Generates permutations in any imposed order 

It is of course possible to extend any permutation generating algorithm to ex- 
hibit all of the above features. However, when such naive algorithms are given 
inputs similar to aaaaaaaaaaaaaaaaaaao^i, the number of wasteful computations 
can be unmanageably huge. Clearly, the need for specially tailored algorithms 
is justified. 

We provide a rigorous proof of correctness of Rohl's algorithm. We also show 
interesting adaptations of the algorithm to generate various other combinatorial 
structures efficiently. 

The paper is organised as follows: we begin by introducing terminologies 
and notations that will be used in the paper in Section [2j and presenting Rohl's 
algorithm in Section [3] Section @] contains a detailed analysis of the algorithm, 
which also includes the proof of correctness of the algorithm (missing in Rohl's 
paper). We then present various efficient extensions of Rohl's algorithm in 
Section [5J to generate many more combinatorial structures. 

Rohl's algorithm involves two stages. The first stage builds up an auxiliary 
array that maintains the number of occurrences of each unique element in the 
input list. The second stage uses this auxiliary array to effect and generates 
the required combinatorial structure. We present a recursive representation of 
the algorithm. It would be quite easy to transform the recursive version to an 
iterative one [23] . 

2 Preliminaries 

In this section, we introduce the key terms, concepts and notations that will be 
used in the paper. 

2.1 Basic Definitions 

Set: A collection of elements without repetitions 
List: A collection of elements possibly with repetitions 

Permutation: A permutation of a list is an arrangement of the elements of 
the list in any order, taking some or all elements at a time 
E.g: Consider a list {a, 6, c, d}; Various permutations are - {d, a, b, c}, {a, d, c}, 
{b,d,a}, {a,d}, {d,c}, {b,a} 

1 There are 20 unique permutations of length 20 
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r-Permutation Set: The r-permutation set of a list is defined as the set of 
all possible r-permutations of the list, where an r-permutation is defined as a 
permutation taking exactly r (1 < r < size(list)) elements at a time 
E.g: Consider a list {a, b,c}. We get: {a,b,c}, {ab,ac,ba,bc,ca,cb} and {abc, 
acb, bac, bca, cab, c6a}0 as the 1-permutation set, 2-permutation set and 3- 
permutation set, respectively. 
The algorithm takes as input: 

• A list of n elements to permute, called input list: C — {l\, I2, ■ • • , l n } 
containing p(< n) unique elements 

• An integer r, 1 < r < n, which denotes the number of elements to be 
taken at a time during permuting 

• A set of p elements which are actually the p unique elements of C in a 
particular sequence, called Order Set (or just Order): O = {01, o 2 , . . . , o p } 

It produces as output the r-permutation set of C, which is a set of m r- 
permutations: V r (C) — {V{, V%, . . . where each V\ is in turn is a list of 

r elements: V\ — {p r n, ■ ■ ■ ,Pi r } V (1 < i < m). The algorithm works such that 
P r (L) follows the order O (definition and explanation in Section |2?2|) . Quite 
obviously, p\^ £ L V(l < i < m), (1 < j < r). Note that we use the terms 
r-permutation (set) and permutation (set) interchangeably, if clear from the 
context. 

2.2 Order of Permutation Set 

Consider the 3-permutation set of the list {a, b, c} : {abc, acb, bac, bca, cab, 
cba}. We see that the permutations are in lexicographic order, i.e they follow 
the order {a,b,c}. At the same time, the 3-permutation set {bac, bca, abc, acb, 
cba, cab} follows the order {b,a,c}. We shall now introduce a formal definition 
of what we mean when we say P r (C) follows the order O. 

Discriminating Index: The discriminating index between two different r- 
permutations is the first (minimum) index in both permutations at which the 
elements differ from each other. Given two permutations V r a and VI , we denote 
the discriminating index as 

V(V r a ,V r b )= min{x : p r ax ^ plJ 

Also, assume that Xo{Pij) denotes the index of the element in O. Thus 
Zoiplj) = x o x =p r tj . 

V r (C) is said to follow an order O if: 

• V (1 < i < m), (1 < j < r) p\ 3 e O 

2 Each permutation in an r-permutation set is actually a list of elements. For compactness, 
we represent the elements in concatenated form. 
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• For every pair of permutations in V r (C), the elements at the discriminat- 
ing index have the same positions relative to each other in O, as do the 
pair of permutations in V r (C) 
V(l<a<6<m): T (p r ad ) <Io{p r bd ) 

where d = T>(V^,Vl). It has to be noted that similar definitions hold for 
both sets of combinations and derangements too. 

2.3 Auxiliary Array 

Central to the algorithm is an auxiliary array denoted as Count Array. It is an 
array of integers built in such a way that Count Array[i] gives the number of 
occurrences of element Oi in the input list C. CountArray is of size p as it is 
maintained parallel to O. 

As an example, consider C — {a,c,b,a,a,c}, O = {c, a, b}; here n = 6 and 
p = 3. We have o\ (= c) occurring twice, 02 (= a) occurring thrice and 03 (= 
b) occurring once. Thus we must have CountArray — {2,3, 1}. Instead, if we 
had O — {a, 6, c}, we would have CountArray = {3, 1, 2}. 

3 The Algorithm 

In this section, we introduce the algorithm. The algorithm takes £, r and O 
as input and produces V r (C) as output, while ensuring that V r {C) follows O. 
The first stage involves building CountArray while the second stage involves 
recursive generation of V r (C) using CountArray. 

3.1 Building CountArray 

The first stage involves building CountArray parallel to O in a way that 
CountArray[i] is an integer which represents the number of occurrences of o,; 
in C. Procedure Q] is a pseudocode representation of how to build CountArray, 
given C and O. We assume the existence of the function Xq which takes an 
element of C as a parameter and returns the position of the element in O. We 
can say Io{h) = x o x = l{. A naive implementation of Xq would be to 
perform a linear search over O, and this would have a worst case runtime of 
0(p). However, if needed, we can achieve O(l) runtime using hash functions. 
We leave the implementation details to the reader. 

3.2 Generating Permutations 

Procedure [1] builds CountArray. The second stage involves using the recursive 
routine APR to generate V r (C), following order O, as output. To this end, we 
use an output array 1Z. 

Procedure [5] is a pseudocode representation of APR. It assumes non-local 
existence of the following: CountArray, O, r, p and 1Z. Also, it has a lo- 
cal parameter - an integer index. Thus the function's header takes the form 
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Procedure 1 : Building Count Array 
1: Input: C and O 
2: Output: CountArray 

3: Count Array[l ...p] ^ 

4: for i <- 1 to n do 

5: pos Io{h) 

6: Count Array\pos] <— Count Array[pos] + 1 



APR(mefex). It is invoked by the function call APR(l). As APR recursively 
calls itself (1 < index < r), TZ[1, ...,r] is populated using CountArray. 



Procedure 2 : APR(mdex) - Permutations 

1: Local: index 

2: Global: CountArray, O, r, p and 7?. 

3: if index > r then 

4: Print ... ,r] 

5: return 

6: else 

7: for i 1 to p do 

8: if Count Array[i] > 1 then 

9: lZ[index] <— Oi 

10: Count Array[i] Count Array [i] — 1 

11: APR(inciex + 1) 

12: CountArray[i] <— Count Array[i] + 1 



4 Algorithm Analysis 

In this section, we begin by explaining how APR works and showing the recur- 
sion tree for a particular example. We then give a formal proof of its correctness. 
We conclude by establishing an upper bound on the runtime of APR. 

We know that the contents of CountArray are constantly changing. Given 
a particular index in 1Z we assign to it an element o, £ O (1 < i < p) only if 
Count Array[i] > 1, at that particular time. We call the assignment minimal if 
we choose the Oj corresponding to the minimum i for which Count Array[i] > 1. 
More formally, we can say that an assignment of oi to a particular index in 1Z 
is minimal iff $ o x £ O; 1 < x < i such that Count Array [x] > 1. Also we 
say populates lZ[i . . .j] minimally to signify that minimal assignment is done at 
every index from i to j (inclusive). 
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4.1 Recursion Tree 



Let us analyse how APR functions by referring to Procedure [5] The function, 
at any recursion level, begins by searching for the first available element, i.e. 
the first index i at which Count Array[i] > 1 (lines 7-8). Once found, the 
corresponding element (oj) is assigned at the current index in 1Z (line 9). It 
then decrements Count Array[i] to reflect the fact that that particular element 
was assigned (line 10). After that, there is a recursive call and the control 
moves to the next recursion level and deeper thereafter performing the same 
set of functions (lines 7-10). Once the control returns to the current recursion 
level, it de-assigns the element that was assigned to the current index in TZ, 
by incrementing CountArray[i] (line 10). It then moves to the next available 
element in Count Array and performs the process of assigning and recursion all 
over again. 



CA = {2,2} 
K= {} 
APR(l) 



index— 1 
CA = {1,2} 

n = {i} 

APR(2) 




index— 1 
CA = {2,1} 
U= {2} 
APR(2) 



index— 2 
CA = {0,2} 

n = {1,1} 

APR(3) 

index— 3 
CA = {0,1} 
U = {1, 1,2} 
APR(4) 

index— 4 




index— 2 
CA = {1,1} 

n = {1,2} 

APR(3) 



index— 2 
CA = {1,1} 

n = {2, i} 

APR(3) 




n = {i, 1,2} 



index— 3 
CA = {0,1} 
TZ = {1, 2, 1} 
APR(4) 
I 

index— 4 



index— 3 
CA = {1,0} 
Tl = {1,2,2} 
APR(4) 

index —4 



index— 3 
CA = {0,1} 
U = {2, 1, 1} 
APR(4) 
I 

index— 4 



=3 

CA = {1,0} 
TZ = {2, 1, 2} 
APR(4) 
I 

index— 4 



index— 2 
CA = {2,0} 
TZ = {2,2} 
APR(3) 

index— 3 
CA = {1,0} 
TZ = {2, 2, 1} 
APR(4) 

index —4 



7J= {1,2,1} H = {1,2,2} TZ={2,1,1} 7J={2, 1,2} 



n = {2, 2, 1} 



Figure 1: Recursion Tree of APR for C = {1, 1, 2, 2}; r = 3; O = {1, 2} 



We venture an example to better illustrate the process. Assume we have 
C = {1,1,2,2}, r = 3 and O = {1,2}. When C and O are fed to Proce- 
dure[TJ we would obtain CountArray[l] = CountArray[2] — 2. Procedure [5] is 
then invoked. The recursion tree that would be obtained is shown in Figure [T] 
(Count Array is abbreviated to CA) . The root of the tree represents the initial 
state, i.e. CountArray = {2,2} and TZ = {}. Every descendant of the root 
represents a particular recursion depth, indicated by the value of index. At ev- 
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ery node, we indicate the values in Count Array and 1Z just before going down 
to the next recursion level followed by the function call that initiates the next 
recursion level. Once we go four levels deep into the recursion, i.e. index = 4, 
the contents of 1Z would be output because index > r (lines 3-4 of Procedure [5]). 
The control then returns to the previous recursion level (line 5). We observe 
that V 3 (C), i.e. the collection of all leaves of the recursion tree from left to 
right, follows the order {1,2}. 

4.2 Proof of Correctness 

In this section, we shall use Vi to denote a general r-permutation instead of 
V\. Also, we shall use Vi[x], instead of pi x , to denote the x th element of the i th 
permutation. 

Theorem 1. The algorithm generates only valid permutations of C 

Proof. We assign an element Oi to any particular index in 1Z only if we have 
Count Array[i] > 1. Also, we decrement or increment Count Array[i] whenever 
Oi is assigned or de-assigned, respectively. This process ensures that APR only 
assigns elements that were actually present in £ and no element is assigned more 
times than it occurs. Hence we can say that every permutation that is output 
is valid. □ 

Theorem 2. The algorithm generates only unique permutations of C following 
order O. 

Before we prove Theorem [21 we must understand how the algorithm gener- 
ates a permutation subsequent to an already generated one. Once a permutation 
is output, the program searches backwards from the end of 1Z for an index where 
it can assign, from the available elements, an element that has a higher position 
in O than the previously assigned element. This means that, if o x were assigned 
at a particular index, the algorithm checks if it can assign any among o x +\ . . . o p 
at the same index. At the first instance (referred to as Property 1) of such an in- 
dex, the algorithm assigns the first among (referred to as Property 2) o x+ ± . . . o p , 
whichever is available. Note that this index would be the discriminating index 
between the previously output permutation and the next permutation that is 
going to be output. Once this is done, the program populates the rest of 7Z 
minimally (referred to as Property 3). 

Proof of Theorem^ This process ensures that, between two consecutive per- 
mutations, we will have at least one index where the elements differ. Given 
that at this index, i.e. discriminating index, we assign one of o x +\ . . . o p , we can 
be sure that the permutation generated subsequent to a particular permutation 
will occupy a higher position in V r (C) (when it follows order O). □ 

Lemma 3. Given three consecutive permutations Vi, Vj and Vk, we have: 
V{Vi,Vj)>V{Vi,Vk). 
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Proof. This shall be proved by contradiction. Let us assume we have Vi , Vj , Vj 
such that 

dij < d lk (1) 

where dij,dj k ,d ik represent each of T>(Vi,Vj), V(Vj,V k ), T^{Vi,V k ), respec- 
tively. By the definition of discriminating index in Section 12. 2[ we have the 
following equations. 

r i [i...(d ij -i)] = v j [i...(d ij -i)} (2) 

lo(V l [d lJ })<l (V J [d lJ }) (3) 

and 

V i [l...(d ik -l)] = V k [l...(d ik -l)} (4) 

To{Vi[d ik ]) <lo{V k [d ik \) (5) 

From equations [T] and [4l we can say V k [dij] = Vi[dij\. By using this in equa- 
tion [3J we get 

loiVM,]) KloiV^]) (6) 

Also, from equations and S] and equation [TJ we get 

V k [l...(d ij -l)]=V j [l...(d ij -l)] (7) 

Equations [6] and [7] imply that V k comes before Vj , when order of consideration 
is O, which is a contradiction to the order of the permutations. This in turn 
implies that the initial assumption is wrong. □ 

Corollary 4. V{Vj,V k ) > V(V h V k ). 

Proof. Equations [5] and 0] follow from the definition of discriminating index. 
From Lemma [3] we have > di k , thus we get 

V k [l...(d ik -l)]=V j [l...(d ik -l)} (8) 

Thus the corollary follows. □ 

Lemma 5. Given two successive permutations that APR generates, sau_P a and 
Vb, $V X € {V r {C)/{V a ,Vb}): such thatV x occurs between V a andVbU 

Proof. This shall be proved by contradiction. Assume there exists such a V x - 
Let us take d ab = V(V a ,V b ), d ax = V{P a ,P x ) and d xb = V(V x ,V b ). By 
the analysis of how APR generates a permutation subsequent to an already 
generated one 

d ax > d ci b [From Property 1] (9) 
d x b > dab [From Property 3] (10) 



3 A/{Ai, A2} Denotes A excluding Ai and A2 



8 



Also we have d ax d ab from Lemma [3] and d xb <^ d ab from Corollary HJ Thus 
d ax = dbx = d ab (~ A say) is the only possibility. 

For V x to be between V a and V b : l {V a [\\) < X {V X [X]) < l {V b [\]). This 
is a direct contradiction to Property 2. Thus we can have no V x in between V a 
and V b . □ 

Theorem 6. The algorithm generates all r -permutations of C. 

Proof. We begin by proving that APR correctly generates the first permuta- 
tion. By Lemma [SJ we have proved that the process of generating the next 
permutation given the current one is accurate in that the immediate next per- 
mutation, in accordance with the order O, is generated. We then prove that 
APR terminates once all permutations are generated. 

At the beginning of the algorithm, we perform a minimal assignment at the 
first index, i.e. index — 1. The program then moves down a recursion level 
and once again performs a minimal assignment. This process continues until 
the last recursion level. In the (r + l) th recursion level, the program outputs 
the permutation. Clearly the first permutation that would be output by APR 
would be formed by populating 1Z\Y . . . r] minimally. 

The program control returns from a level of recursion only after looping in 
[l,p] is complete, i.e. when there are no more available elements that can be 
allotted at that corresponding index in 1Z. We know that the range [l,p] is 
finite because the size of C is finite. Also the maximum recursion depth of the 
program is finite because r is finite. This would ensure that every recursion level 
would eventually end which in turn ensures that the program will eventually 
terminate. □ 

4.3 Running Times 

To establish an upper bound on the running time of APR, let us analyse the 
recursion tree shown in Figure [TJ We see that the number of leaves are exactly 
equal to the number of unique r-permutations of £, i.e. |'P r (£)|. Each of the 
leaves are exactly (r + 1) levels below the root. The critical operations, i.e. 
looping and searching for available elements, are done on the first r levelsQ At 
each level, we loop in the range [l,p]. Thus we can say that the running time 
of APR is bounded by O (\V r (C)\ x r x p) in the worst case. 

It is to be noted that the runtime of Procedure Q] has been ignored in com- 
parison to the runtime of Procedure [5] 

5 Extensions 

In this section we show how the smallest of changes to Procedure [2] helps us 
generate many more combinatorial structures. Please note that the proof of 
correctness of each of the following algorithms follows trivially from the rigorous 
analysis in Section l4~2l 

4 In the (r + l) th level, the permutations are output. 
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5.1 Derangements 

A derangement of a list is a permutation of the list in which none of the ele- 
ments appear in their original positions. Some previous work on derangement 
generating algorithms can be found in P21 IIH] • 

With just two changes to Procedure[5J we are able to generate derangements 

• Non-local existence of £, in addition to the other entities, is required 

• Before assigning an element at a particular index, we perform an additional 
check to ensure that the same element does not occur at the exact same 
index in C, i.e., if we were to assign Oi to Tt[index], we would need to 
ensure that l mde x ^ °i 

The procedure shall be referred to as APR2 (shown in Procedure [3]) . It is 
invoked by the call APR2(1). All we do is perform an additional check to ensure 
that the element being assigned at the current position does not appear in C at 
the same position (Line [8J . Count Array is built exactly as was illustrated in 
Procedure [TJ 

Extending the algorithm to generate partial derangements, like in |14j . is 
easily done. Figure [5] shows the recursion tree when we generate derangements 
of C = {1, 2, 3}; r — 3 and O = {1, 2, 3}. It is to be noted that the program can 
output r-derangements, as well all derangements following an imposed order. 



Procedure 3 : APR2 {index) - Derangements 

1: Local: index 

2: Global: CountArray, O, r, p and 1Z 

3: if index > r then 

4: Print H[l, r] 

5: return 

6: else 

7: for i <!— 1 to p do 

8: if CountArray[i] > 1 and kndex ^ °i then 

9: lZ[index] <— o L 

10: Count Array[i] <— Count Array[i] — 1 

11: APR2(index + 1) 

12: Count Array[i] Count Array[i] + 1 



5.2 Combinations 

A combination of a list of elements is a selection of some or all of the elements. 
It is a selection where the sequence of elements in the selection is not important. 
Some combination generating algorithms that have been proposed are Chase [1] , 
Ehrlich [7J |S] and Lam-Soicher [T7]. 
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CA = {1,1,1} 

n = {} 

APR2(1) 



index— 1 
CA = {1,0,1} 
TZ={2} 
APR2(2) 



index— 2 
CA = {0,0,1} 

n = {2, i} 

APR2(3) 
I 



index— 2 
CA = {1,0,0} 
U = {2,3} 
APR2(3) 
I 

index— 3 
CA = {0,0,0} 

n = {2,3, 1} 

APR2(4) 

I 

index— 4 



index— 1 
CA = {1,1.0} 
TZ = {3} 
APR2(2) 

index— 2 
CA = {0,1,0} 

n = {3, i} 

APR2(3) 

index— 3 
CA = {0,0,0} 

n = {3, 1,2} 

APR2(4) 

index— 4 



n = {2,3,1} 



n = {3, 1,2} 



Figure 2: Recursion Tree of APR2: C = {1, 2, 3}; r = 3 and O = {1, 2, 



CA = {1,1,1} 
K = {} 
APR3(1) 



index — 1 
CA = {0,1,1} 
Tl = {a} 
APR3(2) 




index — 2 
CA = {0,0,1} 
■R = {a,b} 
APR3(3) 

index — 3 



index — 2 
CA = {0,1,0} 
K = {a, c} 
APR3(3) 
I 

index — 3 



index — 1 
CA = {1,0,1} 
TZ = {b} 
APR3(2) 

index — 2 
CA = {1,0,0} 
1Z = {b, c} 
APR3(3) 

index — 3 



■R. = {a,b} 



Tl = {a, c} 



n = {&, c} 



index — 1 
CA = {1.1.0} 
K={c} 
APR3(2) 



Figure 3: Recursion Tree of APR3 for C = {a, 6, c}; r = 2 
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We would require to make just one change to Procedure [2] to be able to 
produce all possible r-combination^| of an input list. The new procedure shall 
be referred to as APR3 (shown in Procedure 01 . All we do is - before assigning 
an element at a particular index in 7Z, we make sure that the element occupies a 
position in O that is greater than the position of element at the previous index, 
in O. At the first index however, we may assign any element. The change is 
in Line [8] of Procedure |4j Count Array is built in exactly as was illustrated in 
Procedure [TJ 

By changing, the condition i > Xo(TZ[index — 1]) to i > Io{lZ\index — 1]), 
we can get combinations which contain no repeated elements. The recursion 
tree obtained when we generate all combinations of C — {a, 6, c} with r = 2 and 
O = {a, b, c} is shown in Figure [3] 



Procedure 4 : APR3(index) - Combinations 

1: Local: index 

2: Global: CountArray, O, r, p and 1Z 

3: if index > r then 

4: Print H[l, r] 

5: return 

6: else 

7: for i 1 to p do 

8: if CountArray[i] > 1 and (index = 1 or i > Zo(lZ[index — 1])) then 

9: 1Z [index] <r- Oj 

10: Count Array[i] Count Array[i] — 1 

11: APR3(index + l) 

12: Count Array[i] Count Array[i] + 1 



As is evident from Figure|31 there can be many wasteful branches with certain 
kinds of input to APR3. For example, if we were to have C = {a, 6, . . . , y, z} 7 
r = 26 and O = {a, b, . . . , y, z}, we would have a recursion tree similar to 
Figure SI where branches ending with denote wasteful branches, i.e. recursion 
branches that do not produce any output. All unnecessary computations are 
because we loop from o\ . . .o v at every index. However, based on two simple 
observations, we can completely eliminate all wasteful branches 

1. Once o x is assigned at a particular index, subsequent indices can only 
contain one of o x +\ . . . o p 

2. In the case where we have already assigned elements to the first (3 indices 
of 1Z, and TZ[/3] — o y , we need not loop if we are sure that there are fewer 
than (r — j3) elements left over, i.e., if Sf=y Count Array [i] < (r — /?), 
we can terminate looping in the current recursion level and return to the 
previous recursion level 



5 Dcfinition analogous to definition of r-permutation. 
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index — 1 
CA = {0,1, ...,1} 
TZ = {a} 
APR3(2) 



index — 26 
CA = {0,0,. . . ,0} 
TZ — {a, b, . . . , y, z} 
APR3(27) 
I 

index = 27 



CA = { 1 , . . . , 

n = {} 

APR3(1) 




index — 1 
CA = {1,0,1, ...,1} 
TZ={b} 
APR3(2) 



index — 25 
CA = {1,0,..., 0} 
TZ — {b, c, . . . ,y, z} 
APR3(26) 



index — 1 
CA = {1,...,1,0} 
TZ = {z} 
APR3(2) 





TZ — {a, 6, . . . , y, z} 



Figure 4: Recursion Tree of APR3 for £ 
{a,b,...,z} 



— {a, b, c, . . . , z}; r — 26; O = 



CA = {1,1,1,1} 

n = {} 

APR3 n (l) 



index— 1 
CA = {0,1,1,1} 
n = {a} 
APR3 n (2) 





index— 2 
CA = {0,0,1,1} 
n = {a,b} 
APR3 n (3) 



index— 3 
CA = {0,0,0,1} 
1Z = {a, b, c} 
APR3 n (4) 
I 

index— 4 



index— 3 
CA = {0,0,1,0} 
TZ = {a, b, d} 
APR3 n (4) 

index —4 



Tl = {a, 6, c} 



Tl = {a, 6, d} 



index— 2 
CA = {0,1,0,1} 
H = {a, c} 
APR3 n (3) 

index— 3 
CA = {0,1,0,0} 
TZ — {a, c, d} 
APR3 n (4) 

index— 4 



TZ — {a, c, d} 



index— 1 
CA = {1,0,1,1} 
TZ={b] 
APR3 n (2) 
I 

index— 2 
CA = {1,0,0,1} 
TZ = {b, c} 
APR3 n (3) 
I 

index— 3 
CA = {1,0,0,0} 
TZ = {b,c,d} 
APR3 n (4) 
I 

index— 4 



TZ= {b, c, d} 



Figure 5: Recursion Tree of APR3 n for C — {a, b, c, d}; r = 3; O — {a, b, c, d} 
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Procedure 5 : Building CumCA 
1: CumCA[l] <- 
2: for i <— 1 to p do 

3: C™C4[j + !]■<- CumCA[i] + Count Array[i] 



Procedure 6 : APR3 n (index) - Combinations Efficiently 

1: Local: index 

2: Global: CumCA, CountArray, O, r, p and 1Z 

3: if index > r then 

4: Print U[l, r] 

5: return 

6: else 

7: i <- Io(1Z[index — 1]) 

8: while (CumCA[p + 1] — CumCA[i + 1] + Count Array[i}) > (r — index) 
do 

9: if CountArray[i] > 1 then 

10: lZ[index] <— Oj 

11: Count Arrat/[i] <— CountArray[i] — 1 

12: APR3 n (mdex + 1) 

13: CW Array [i] Count Array[i] + 1 

14: i «- « + 1 



This means that in Line [7] of Procedure HI the loop limits would need to be 
made tighter. For simplicity, we modify the definition of I® (element) to return 
if element ^ O. Also, we assign to 7£[0] an element that does not exist in O. 
This would mean that Zo(7£[0]) = 0. Now, Point 1 indicates that we can begin 
looping from Xa(TZ[index — 1]) instead of 1. 

To incorporate Point 2, we build a Cumulative Array [2] on CountArray. 
It is called CumCA and is of size (p + 1). It is built such that (CumCA[y + 
1] — CumCA[x}) = Xa=x Count Array[i]. A quick implementation is shown 
Procedure El 

It is to be noted that CumCA reflects CountArray values at the beginning. 
We know that CountArray values are constantly changing. Thus we can use 
CumCA only for indices where CountArray values are known to have not 
changed. Procedure [6] is a pseudocode representation of the modified version 
of APR3. This is denoted by APR3 n . The effectiveness of APR3 n is made 
apparent in Figure [5j Analogous to the argument in Section 14. 3[ the running 
time of APR3 n is in 0((n - r + 1) x r x |C r (£)|) 

5.3 Catalan Families 

In this section, we show how the algorithm can be modified to output all possible 
valid parenthesizations of (n + 1) factors. A valid parenthesization of (n + 1) 
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factors can be defined as a sequence of n opening brackets and n closing brackets 
with the condition that at no point in the sequence should the number of closing 
brackets be greater than the number of opening brackets. We shall refer to the 
set of all possible valid parenthesizations of (n+1) factors as n-Parenthesizations. 
Some previous work can be found in [SJ . 



Procedure 7 : Building Count Array 
1: Input: n 

2: Output: Count Array 

3: C ountArray[\] <— n 
4: Count Array [2] <— n 



Procedure 8 : APR4(miiex) - n-Parenthesizations 



1 


Local: index 


2 


Global: CountArray, n, O and 1Z 


3 


if index > 2n then 


1 


Print U[l,.. .,2n] 


5 


return 


6 


else 


7 


for i = 1 to 2 do 


8 


if (i ^ 2 or CountArray[2] > Count Array[l\) and Count Array[i] > 
1 then 


9 


lZ[index] Oj 


10 


CountArray[i] <— CountArray[i] — 1 


11 


APR4(index + 1) 


12 


Count Array[i] <— Count Array[i] + 1 



The input to the algorithm is n. We initialize CountArray such that 
CountArray[l] = CountArray[2] — n. This can be viewed as an input list 
of n opening brackets and n closing brackets (p — 2). We then generate all pos- 
sible permutations of this input list using CountArray and make sure that at no 
point in a permutation we assign more closing brackets than opening brackets. 
Wesetoi ="(" and o 2 = ")" ■ 

Once again, we see that we generate a different combinatorial structure us- 
ing pretty much the same algorithm. The building of CountArray is shown 
in Procedure [7] and the recursive generation of parenthesizations is shown in 
Procedure [51 The recursive procedure is called APR4. 

5.4 Subsets 

True to the formula 2" = (1) > we know that all subsets of a list can 

be generated using an algorithm that generates all r-combinations. This is 
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Procedure 9 : APR5(m<ie2;) - Subsets 

1: Local: index 

2: Global: CountArray, O, n, p and 1Z 

3: Print TZ[1, . . . , index] 

4: if index > n then 

5: return 

6: else 

7: for i ■<— 1 to p do 

8: if CountArray[i] > 1 and (index = 1 or i > la(TZ[index — 1])) then 

9: lZ[index] <r- Oj 

10: Count Array[i] <— Count Array[i] — 1 

11: APR5(index + 1) 

12: CountArray[i] <— CountArray[i] + 1 



achieved by calling the r-combinations generating algorithm successively with: 
< r < n. However, the structure of APR allows us to do it more efficiently. 

All subsets of a list of n elements can be obtained by calling APR5 (shown 
in Procedure It is a modified version of APR3 (shown in Procedure [3J with 
r = n and the Print statement before the if block. CountArray is again built 
exactly as was shown in Procedure [TJ 

5.5 Integer Compositions and Partitions 

A composition of an integer n is a set of strictly positive integers which sum up 
to n [10]. For example, 3 has four compositions - {{1, 1, 1}, {1, 2}, {2, 1}, {3}}. 
Generally, a composition of n can contain any number from 1 to n. However, 
it is also interesting to study a variation of the problem wherein we are not 
allowed to use all numbers. 



Procedure 10 : APR6(mdea;) - Integer Compositions 

1: Local: index 

2: Global: n, O, p and TZ 

3: if n = then 

4: Print (index - 1)] 

5: return 

6: else 

7: for i <!— 1 to p do 

8: if n > o, then 

9: lZ[index] o, : 

10: n <— n — Oi 

11: APR6(mefex + 1) 

12: n ^— n + Oi 
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We now present an algorithm (APR6), based on the same algorithmic struc- 
ture used so far, that given an n and O = {oi, . . . , o p }, generates all possible 
compositions of n using elements in O. It is shown in Procedure 1101 An ex- 
ample recursion tree is shown in Figure [5] It is easy to modify Procedure [TU] 
to generate all integer partitions, instead of integer compositions (analogous to 
how we generated combinations by using a permutations generating algorithm) . 

It is possible that when given an O = {oi, . . . , o p }, we have a limited number 
of some or all of the numbers. For example, we could have n = 15; O = 
{1, . . . , 15} with the condition that no number can be used more than twice. 
This can be handled by maintaining an auxilary array Count Array parallel to 
O. 



indc 



n = {3,3} 

APR6(3) 
I 

index — 3 
n = 
U = {3,3,3} 
APR6(4) 
I 

index — 4 




TZ = {3,3,3} 



K = {3,2,2} 
APR6(4) 
I 

index — 4 
n = 
1Z = {3, 2, 2, 2} 
APR6(5) 



K - {3, 2, 2, 2} 



n — ' 
K = {2, 3} 
APR6(3) 

index — 3 
n = 2 
TZ = {2,3,2} 
APR6(4) 

index — 4 
n = 
11 = {2, 3, 2, 2} 
APR6(5) 



index — 2 

n = 5 
K = {2,2} 
APR6(3) 

index — 3 
n = 2 
U = {2,2,3} 
APR6(4) 

index — 4 
n = 
Tl = {2, 2, 3, 2} 
APR6(5) 

index — 5 



index — 'I 
n — 5 

n = {2, 2} 

APR6(3) 
I 

index — 3 
n — 3 
= {2,2,2} 
APR6(4) 
I 

index — 4 
n = 
7?. = {2, 2, 2, 3} 
APR6(5) 
I 

index — 5 



= {2, 3, 2, 2} U = {2, 2, 3, 2} 



K = {2, 2, 2, 3} 



Figure 6: Recursion Tree of APR6 for n = 9; O = {3, 2} 



6 Conclusion 

Algorithms to generate combinatorial structures will always be needed, simply 
because of the fundamental nature of the problem. One could need different 
kinds of combinatorial structures for different kinds of input. Thus, having one 
common effective algorithm, as the one proposed in this paper, which solves 
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many problems, would be useful. 

One must note that a few other non-trivial adaptations - generating all palin- 
dromes of a string, generating all positive solutions to a diophantine equation 
with positive co-efficients, etc, are also possible. 

7 Future Research 

The proposed algorithm conclusively solves quite a few fundamental problems in 
combinatorics. Further research directed towards achieving even more combina- 
torial structures while preserving the essence of the proposed algorithm should 
assume topmost priority. 

One could also perform some probabilistic analysis and derive tighter average 
case bounds on the runtime of the algorithms. 
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